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Abstract: We propose a method for reconstructing the mass of a particle, such 
as the Higgs boson, decaying into a pair of r leptons, of which one subsequently 
undergoes a 3-prong decay. The kinematics is solved using information from the 
visible decay products, the missing transverse momentum, and the 3-prong r decay 
vertex, with the detector resolution taken into account using a likelihood method. 
The method is shown to give good discrimination between a 125 GeV Higgs boson 
signal and the dominant backgrounds, such as Z° decays to rr and W ± plus jets 
production. As a result, we find an improvement, compared to existing methods 
for this channel, in the discovery potential, as well as in measurements of the Higgs 
boson mass and production cross section times branching ratio. 
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1. Introduction 

The Standard Model Higgs boson - or something like it - has been found [1,2] and 
the race is now on to determine its detailed properties. Of particular interest is the 
rr decay channel, where, despite sensitivity to a signal comparable to that predicted 
in the Standard Model with rrih — 125 GeV, no evidence for a signal has been seen. 
On the contrary, the CMS collaboration observes a deficit in the 8 TeV LHC data, 
compared to the background prediction in the Standard Model without a Higgs. 1 

This may be evidence for physics beyond the Standard Model or it may be a 
statistical fluctuation. It may also be evidence that we are doing something wrong. 
We should dearly like to get to the bottom of the mystery, but progress is slowed by 
a number of complications in the h — > tt search channel. 

One complication is the problem of triggering and identifying r leptons, which 
decay in a variety of ways, some of which resemble common-or-garden QCD jets. A 
second is that, since a r decay always involves one or more invisible neutrinos, it is 
not possible to simply reconstruct the invariant mass of the rr system and thence 
to perform a standard resonance search. A third is the problem of distinguishing 
h — > tt signal events from the dominant backgrounds, namely Z — > tt decays 
(which, moreover, lie nearby in rr invariant mass) and production of QCD jets, with 
or without a W boson, in which other leptons or jets are mistakenly identified as r 
leptons. 

The third issue, of distinguishing the signal from backgrounds, is compounded by 
the second issue, that we are a priori unable to reconstruct the invariant mass of the 

1 ATLAS is yet to publish results for data taken at 8 TeV. 
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tt pair. As a result, and as is readily apparent from the data shown in [3], the signal 
and backgrounds sit 'on top of each other' in distributions of the observables that 
are currently available to us. Not only does this increase the integrated luminosity 
required to be sure that an apparent signal is not a mere background fluctuation, 
but it also increases our vulnerability to systematic errors. 

To be explicit, we have, roughly speaking, a signal (the Higgs) that consists of 
a narrow peak in invariant mass, a background (the Z) that is also a narrow peak, 
but centred elsewhere, and additional backgrounds (like W plus jets and QCD) that 
are approximately flat in invariant mass. The signals and backgrounds are otherwise 
roughly indistinguishable in their dynamics. Now, if the observables that we have 
available are uncorrelated with the invariant mass, then we are essentially reduced 
to counting events in order to try to discover the signal and our ability to do so is 
greatly limited by the total statistics available. We are, moreover, completely at the 
mercy of systematic uncertainties in the overall background normalization, which we 
have no way to measure in data. Even if we were able to make a discovery in this way, 
we could at most make one measurement of the signal properties (its overall size) 
and here too we would be exposed to the systematic uncertainty in the background 
normalization. 

Conversely, imagine that we find a way to reconstruct, more or less, the invariant 
mass. The first benefit is that we are no longer limited by the overall statistics, but 
rather by the number of signal and background events in a region of invariant mass 
of our choosing (near 125 GeV being the obvious choice for h — > tt). Moreover, we 
now have a clean separation between the signal and background and indeed between 
the different backgrounds themselves. This opens up the possibility of measuring 
the backgrounds using a data-driven approach, rather than relying on Monte-Carlo 
predictions. Indeed, a simple sideband analysis would suffice, in which the Z back- 
ground is measured in a 'control' region near 90 GeV and the other backgrounds are 
measured in a control region away from the peaks near 90 GeV and 125 GeV. Finally, 
independent measurements of the signal mass and cross-section times branching ratio 
become possible. 

Needless to say, the real situation is rather more complicated for h — > tt at the 
LHC, but the basic principle remains the same: the more we are able to separate the 
signal and the different background components from each other, the less we shall 
find ourselves at the mercy of statistical and systematic uncertainties. 

So, how could we reconstruct something like the tt invariant mass? Several 
approximate methods or observables have previously been suggested in the literature 
(see, for example, [4-9]). Some of these suffer from being rather poorly correlated 
with the invariant mass (some provide, for example, only an upper or lower bound 
on it), while others [4,8,9] suffer from the fact that they turn out to be ill-defined 
for a significant fraction of events. 

Here we wish to propose yet another method, which differs significantly in that 



-2- 



we focus on the subset of events in which a r lepton undergoes a 3-prong decay. This 
implies an immediate disadvantage in the form of a reduced number of signal events 
overall for a given integrated luminosity: a r lepton has a branching ratio of 15 % for 
a 3-prong decay (of which 9.3 % are to 7r~7r + 7r~z/ r and 4.6 % are to 7r~7r + 7r~7r°z/ r ) [10], 
meaning that only 28 % of di-r events feature at least one 3-prong decay. There is 
another disadvantage, which is that these events are typically more difficult to trigger 
on than other r decays. However, the hope is that the disadvantages are more than 
compensated by the advantages. 

These advantages all stem from the fact that the presence of a 3-prong decay 
allows us to reconstruct the rr invariant mass, if the invariant mass of the neutrino 
or neutrinos from the other r decay is known. Thus, for hadronic decays of the other 
r, we can fully reconstruct events (up to a discrete ambiguity and in the absence 
of detector mismeasurements, both of which we shall deal with below); for leptonic 
decays of the other r, we are able to partially reconstruct events. As a result we 
hope to benefit from a reduced exposure to statistical and systematic uncertainties 
as argued above. 

The extra kinematic information needed to reconstruct comes from the loca- 
tion of the secondary (3-prong r-decay) vertex: if one can measure with reasonable 
accuracy the impact parameter of each of the three charged tracks, defined as the 
shortest distance between the track and the primary vertex, then the intersection of 
these impact parameters gives the location of the secondary vertex. 2 

Now, the location of the secondary vertex tells us the direction of the r momen- 
tum; the mass-shell constraint for the r then allows to reconstruct the magnitude of 
the r momentum, up to a possible two-fold ambiguity. Given the measured missing 
transverse momentum, we are then able to reconstruct the momentum of the other r 
(up to a further possible two- fold ambiguity), provided we know the invariant mass 
of the neutrino(s) produced in the other r decay. 

The reconstruction process just described can only be expected to work if things 
are well measured. For example, if they are not, we may end up with no real solutions 
to the kinematic constraints. We account for this by defining an ad hoc likelihood 
function in which we convolute the observed quantities with a function parameter- 
izing the detector response. The maximum of this likelihood function is an event 
observable (albeit one with an obscure definition) and it is this observable that we 
propose to use for signal discrimination. The fact that we invoke a likelihood function 
also allows to deal with the unknown invariant mass of the two neutrinos produced 
in a leptonic r decay: we marginalize with respect to the unknown invariant mass, 
including the matrix element for the r decay. 

2 The fact that there are three intersections means that we have an indication of the quality of 
the vertex reconstruction in an individual event. This information could, in principle, be fed into 
the likelihood function that we shall use to account for detector response, but we do not do so here. 
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In fact, there is yet another advantage to focussing on 3-prong decays, which is 
that the fake backgrounds (coming from, e.g. W+ jets and QCD) will be reduced. 
The reasons for this are twofold: firstly, jets and other leptons are presumably less 
likely to fake a 3-prong decay (with a reconstructed secondary vertex) than they are 
to fake a generic hadronic r or leptonic r decay; secondly, even if a 3-prong decay 
is faked (or arises from a genuine r decay), background events of this type will not 
necessarily reconstruct a rr resonance. To put it another way, when we try to solve 
for resonance mass for such events using the kinematic constraints, we may find that 
there are no real solutions. 3 

The outline of the paper is as follows. In the next Section, we describe the 
algebraic details of the reconstruction procedure. In Section |3], we present the results 
of our numerical simulations and in Section ^, we draw our conclusions. 

2. The method 

As described in the introduction, events are reconstructed using the decay vertex 
information. If one of the r leptons decays to a 3-prong hadronic system and a r 
neutrino, then the displacement r from the primary interaction point to the r decay 
vertex should be measurable with useful precision. 4 

Let us begin by considering the limit of perfect detector resolution. Denoting 
the energy, momentum and mass of the 3-prong decay products by Ej, pj and rrij, 
respectively, and the angle between r and by 9, the momentum of that r lepton 
can be reconstructed, with a twofold ambiguity, as p T = p r r/|r|, where (neglecting 
the neutrino mass) 

(m 2 + m^)pj cos 9 ± Ej (m 2 — m 2 ) 2 — 4m 2 p 2 sin 2 9 
Pt = 2(m 2 +p] sin 2 9) 

The other r lepton may decay either hadronically or leptonically, into a visible 
system f (a hadronic jet or a charged lepton) and an invisible system i' (a r neutrino 
or a pair of neutrinos). The transverse momentum of the invisible system is found 
from the missing transverse momentum, , and the reconstructed momentum of 
the first r via 

PTi' = f T + PTj ~ PTt ■ (2.2) 

Given the invariant mass of the invisible system, m,/ = m u = for a hadronic and 
my = m vv > for a leptonic decay, one can then solve for the invisible longitudinal 

3 Unfortunately, it is not possible for us to reliably estimate the size of this effect, since neither 
the full details of the experimental r reconstruction algorithm nor the resulting efficiencies or fake 
rates for 3-prong decays are public. 

4 The ATLAS CSC [11] estimates a resolution of 0.6 and 0.01 mm in the directions parallel and 
perpendicular to the displacement, respectively. 
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(2.1) 



momentum, again with a twofold ambiguity: 



< = — (atpLji ± E jr -\/a 2 - jii'fiA , (2.3) 



PL 



where 



l^i'=PTi'+ m h = Pry + m ?' ' OL = ^{m\-m), -m-,) - piy ■ Pt? ■ (2-4) 

The momentum of the second r can now be reconstructed as p r / = pj' + pj>, and 
hence the invariant mass of the rr system, m TT is determined, up to a fourfold 
ambiguity. 

Now consider a real detector and let q = (r, Ej, pj, p^-, p ) correspond to the 
measured quantities. These do not coincide with their true values in an event, which 
we now denote by q, but rather are shifted by amounts depending on the detector 
resolution, which we describe by a response function, /(q, q). Then the likelihood, 
as a function of the true invariant mass m TT , for an event with measured quantities 
q, may be written as 

£(m TT |q) = J dq/(q, q) M(q) 5[m TT - m rr (q)] (2.5) 

where A^(q) is the matrix-element squared for the decay and m TT (q) is the invariant 
rr mass reconstructed from the true quantities q according to the recipe described 
above. Here .M(q) should also include the jacobian factor relating the final-state 
phase space to the quantities q. We find, in most cases, that including these ef- 
fects gives, at best, a marginal improvement in the mass resolution. Indeed, some 
effects (such as the exponential distribution of the r-decay lifetimes), lead to large 
fluctuations in the likelihood integrand and hence to large errors in the numerical 
integration, worsening the mass resolution. Thus we do not include these effects, in 
general. 

There is, however, one such effect which we do include. In the case of leptonic 
decay of the second r, the matrix elements also depend on the momenta of the two 
invisible neutrinos, and the right-hand side of eq. ( |2.5| ) should include an integration 
over their phase space, weighted by the expected distribution of the vv invariant 
mass. This is conveniently expressed as P{m^ v ) d$ vu , with 

d ^= 2(2^)3^(4^)3 ^- (2 ' 6) 
where dQ* is the element of solid angle in the vv centre-of-mass frame and 

mi \ mi I \ mi I 



- 5 - 



At each phase-space point, the value of m vu is then used, with this weight, for the 
reconstruction of the decay. 

In the integral over the delta function in eq. (|2.5|) , we include all real solutions to 
the equation rh TT = m TT (q). In contrast to [7], complex solutions should be discarded 
here, since they cannot correspond to true values for genuine rr resonance events. 
Note, however, that measured values q that would correspond to complex values of 
m TT lying close to the real axis if reconstructed directly, which correspond to real 
solutions shifted slightly by detector resolution, will be included in the integration at 
neighbouring values of q. For fake backgrounds, we often find that no nearby values 
of q lead to real solutions, allowing the event to be rejected. 

We perform the integrations in eq. ( [2.5|) by a Monte Carlo method similar to 
that adopted in [12], generating a large number of points q distributed around each 
measured point q according to a smearing function /(q, q) deduced from detector 
simulations. The jet masses are generated according to certain probabilities that we 
describe in the Appendix. Each real solution m TT = m rT (q) is entered into a his- 
togram with the corresponding weight. Because the Monte Carlo method generates 
only a finite number of points, all histogram bins are given a small positive offset, to 
avoid multiplications by zero. 

Since our likelihood function does not encode the matrix element in its entirety, 
we cannot expect to be able to make statistical inferences directly from it in the usual 
way. Doing so might lead, for example, to us wrongly rejecting the Standard Model 
Higgs boson hypothesis, or obtaining a biased measurement of its mass. Instead, we 
use our ad hoc likelihood function to define an event observable in the following way: 
for each event, we extract the smallest value of m TT that gives a local maximum of 
the event likelihood and define this to be the event value of the observable mgy- 5 
Our simulations suggest that this observable gives distributions for Higgs and Z 
boson event samples whose peak locations provide an unbiased determination of the 
corresponding boson mass, with small tails. In any case, the presence of such effects 
can be mitigated by comparing experimental distributions of observables to template 
Monte-Carlo samples, as we do in simulations of pseudo-experiments below. 

3. Simulations and results 

Our study is based on a sample from the Herwig++ event generator [13], version 
2.52 [14] and corresponds to an integrated luminosity of 20 fb -1 , which should be 
achieved at the LHC before the end of 2012. 

As regards the signal, recent Standard Model predictions for Higgs production at 
the LHC may be found in Ref. [15]. For a Higgs mass of 125 GeV, at a collision energy 

5 Since we have to solve a quartic equation, and since real roots thereof come in pairs, we 
invariably hnd multiple local maxima in the event likelihood. 
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3pr-had 


3pr-lep 


had- had 


had-lep 


H 


6.3 


5.7 


17.2 


28.5 


Z 


528.8 


416.5 


1451.4 


2036.0 


Wj 


50.5 


32.4 


135.3 


161.9 



Table 1: The cross section times efficiency of each process/channel in fb. 

of 8 TeV, the expected total cross section is 19.52 pb in the gluon fusion channel and 
1.58 pb in the vector boson fusion channel, with a probable uncertainty of around 
10%. The predicted SM branching ratio for rr decay, also given in Ref. [15], is 6.4%. 
Higgs production followed by rr decay thus corresponds to a cross section of 1.34 
pb. 

For the Z background, CMS [16] reports a flavour-averaged prediction of a(pp — > 
Z — > 11) = 1.13 nb and a measurement of 1.12 nb for the 8 TeV LHC. We take 
a{pp — t- Z — )■ rr) = 1.13 nb. For the W + j background, we use a{pp — > Wj) = 2.15 
nb, taken from Herwig++. 

All of our simulations are carried out at the parton level, without showering 
or hadronization effects, apart from hadronic tau decays which we simulate using 
Herwig++ [17]. The detector response was modelled as follows. 

Firstly we assume identification efficiencies of 0.4 and 0.3 for the 1- and 3-prong 
hadronic taus, respectively [18]. The lepton identification efficiency is assumed to 
be 0.9 in our analysis. To estimate the number of W + j events in which the jet 
mimics a 1- or 3-prong hadronic tau, we use the fake rates of 0.01 and 0.002 for 1- 
and 3-prong taus, respectively [19]. 

Secondly, we parameterize detector mismeasurements by smearing the energy 
component of jets and leptons with a(E)/Ej = 0.5 GeV^/-^/^ and a(E)/Ei = 
0.05 GeV 5 / y/E~e, respectively. For the missing transverse momentum, we smear each 
component with a x = a y = 5 GeV. 

For the r decay vertex, the Monte-Carlo truth position is smeared by Gaussian 
distributions of widths 0.613 ±0.008 mm and 10.5 ±0.2 /im, in the directions parallel 
and perpendicular to the 3-prong tau-jet, respectively. For jets that fake taus, we 
take the truth vertex position to be zero and then smear as above. 

We then apply event selection cuts to purify the signal, for which we impose 

p Tj > 20 GeV, p T y > 20 GeV, f T > 20 GeV. (3.1) 

The cross section for each process/channel after taking account of the efficiencies of 
(mis) identification and the selection cuts is listed in Table |l| 

In Figure [I] we compare the signal and background distributions of our variable 
msv with the existing variables m e ff and m v i S . m v i S is simply the invariant mass 
of the visible products of both r decays, while m e ff includes the missing transverse 
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Figure 1: Signal and background distributions of msv (top), m v i s (centre) and m c g 
(bottom), for an integrated luminosity of 20 fb _1 at the 8 TeV LHC, for rrih = 125 GeV. 
Left: lepton-hadron modes; right: hadron-hadron modes. 



momentum (an explicit definition may be found in [8]). We show results for the 
lepton-hadron modes and hadron-hadron modes separately. The better separation 
between signal and backgrounds that we expected to obtain using msv is clear to see 
in the Figure. Indeed, m vis has distributions for the Higgs signal and Z background 
which are strongly peaked, but the two peaks sit on top of each other. m eff incorpo- 
rates extra information in the form of the missing transverse momentum, and slightly 
increases the separation between the maxima of the Higgs and Z boson peaks, but 
at the cost of introducing large tails (from the smearing of the missing transverse 
momentum measurement). As a result, the Higgs signal is easily hidden in the large 
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tail of the Z background. Moreover, the shapes of all components become similar, 
making discrimination difficult when the overall normalizations are uncertain. In 
contrast msv provides a good separation between the narrow Higgs and Z boson 
peaks. These peaks are, furthermore, very different in shape from the continuum 
W+ jet background. 

Nevertheless, it is also apparent that the statistics available using msv are lower 
than for the other variables, even in the region of maximum signal. We need, there- 
fore, to make a quantitative comparison of the three variables. To do so, we generate 
distributions of them (for m^ = 125 GeV signal and backgrounds) in ten pseudo- 
experiments, each corresponding to an integrated luminosity of 20 fb _1 of 8 TeV 
LHC data. Each pseudo-experiment is then compared to template model distribu- 
tions with different values of rrih and with different normalization factors fh, fw, an d 
fz for the Higgs signal and W + j and Z backgrounds, respectively. (The values 
fh,w,z = 1 correspond to the leading order Monte-Carlo prediction.) Allowing the 
model distribution normalizations to float in this way allows us not only to take into 
account some of the most important systematic effects (arising from the uncertainties 
in the luminosity, the Monte-Carlo predictions, and data-driven extrapolations), but 
also provides a means to measure the cross-section times branching ratio for Higgs 
production followed by decay to tt. 

We make a cut on the observable of interest itself, so as to maximize its discovery 
potential. Roughly speaking, this cut selects the region that contains the bulk of the 
signal events for that observable. Thus we choose m sv G [110, 150] GeV, m vis G 
[75, 110] GeV, and m eff G [100, 150] GeV. 

We then compute, for each pseudo-experiment, a binned Poisson log-likelihood, 
log£ = Ej log C p (n* nal ; Xij , where n* nal is the number of events observed in his- 
togram bin i in a given pseudo-experiment and X{ = Xi{m h , f h , f z , fw) is the number 
of events expected in a given model, parameterized by rrih, fh, fz, fw- 

To assess the discovery potential, we then compute the difference in log-likelihood 
between models with and without a Higgs signal. In the model without a signal, 
we maximize the log-likelihood with respect to fz, and fw, whereas in the model 
with a signal, we additionally maximize with respect to rrih and fh- In Fig. |[ we 
show —2 times the difference in log-likelihood for the three variables. The centre 
of each bar shows the mean value over trials, while the width of each bar gives 
the root-mean-square deviation over trials. We observe a significant improvement 
using msv i n the hadron-hadron channel, along with a more modest improvement 
(compared to m vis ) in the leptonic channel. The absolute values of the discovery 
significance are exaggerated, since we have neglected sub-dominant backgrounds and 
many uncertainties, but the relative performance of the different variables should be 
meaningful. 

To assess the expected resolution in the Higgs mass measurement, we first show, 
in Fig. H the variation in — 21og£, averaged over pseudo-experiments, for values 
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Figure 2: Discovery potential using msv, compared to m e g and m v i s , for lepton-hadron 
(left) and hadron-hadron (right) modes, for an integrated luminosity of 20 fb _1 at the 8 
TeV LHC. The centre of each bar shows the mean value over trials, while the width of each 
bar gives the root-mean-square deviation. 



of the model Higgs mass, rrih, in the neighbourhood of 125 GeV, after maximizing 
C with respect to fh, fz, and fw- In the figure, we have placed the minimum of 
— 21og£ at height zero. For each variable, we estimate the precision of the mass 
measurement from a quadratic fit to the log-likelihood. The resulting fractional 
uncertainties are shown in Fig. [| and are seen to be much smaller for msv than for 
the other two observables, in both hadron-hadron and lepton-hadron modes. This is 
easily explained by the fact that the Higgs signal peak is sharpest for msv- For the 
measurement of the product of the production cross section and the branching ratio 
for the decay, the procedure is exactly analogous, except that now we vary f h , after 
maximizing with respect to m^. The resulting fit and fractional uncertainties are 
shown in Figs. [| and |6], respectively. Again, there is always an improvement when 
using msv, though the gain is greater in the hadron-hadron channel. 



4. Conclusions 

As shown in Figs. our simulations suggest that a significant improvement in dis- 
covery potential, Higgs boson mass resolution, and measurement of production cross 
section times branching ratio can be obtained by focussing on 3-prong r decays. The 
performance of msv is roughly comparable irrespective of whether the other r meson 
decays leptonically or hadronically, leading to greater gains in the hadronic-hadronic 
channel, where the other observables perform more poorly. One possible reason for 
this is that the detector resolution is invariably poorer in this channel, which contains 
extra jets. Thus the performance of m vis and m eff is degraded. However, these events 
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Figure 3: The log-likelihood function near = 125 GeV obtained using the mgy, m e f[ 
and m v ; s distributions, for lepton-hadron (left) and hadron-hadron (right) modes, for an 
integrated luminosity of 20 fb _1 at the 8 TeV LHC. 
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Figure 4: Fractional uncertainty in the Higgs boson mass, as measured using the mgv, 
m e ff and m v ; s distributions, for lepton-hadron (left) and hadron-hadron (right) modes, for 
an integrated luminosity of 20 fb _1 at the 8 TeV LHC. 



are also fully reconstructible using the vertex information, in the absence of smear- 
ing. As a result, the likelihood that defines the observable m sv is able to correct 
for the extra smearing to a certain extent, by insisting that the unsmeared quantites 
consistently reconstruct the event. 

The gains are greatest for the mass measurement, which is perhaps not surprising 
since our method provides a means to reconstruct the mass whilst partially correcting 
for the uncertainties that are introduced by the detector resolution. 
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fh f h 

Figure 5: The log-likelihood as a function of the signal strength fh, obtained using the 
msv; m eS and m v i a distributions, for lepton-hadron (left) and hadron-hadron (right) modes, 
for an integrated luminosity of 20 fb _1 at the 8 TeV LHC. fh = 1 corresponds to the truth 
value. 
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Figure 6: Resolution for measurement of the production cross section times branching 
ratio for pp — > h — > tt for nih using mo,y, compared to m e g- and m v ; s , for lepton-hadron 
(left) and hadron-hadron (right) modes, for an integrated luminosity of 20 fb _1 at the 8 
TeV LHC. 



The results of our simulations are encouraging, but they should be taken with 
a pinch of salt. The simulations themselves are rudimentary, and we have only 
performed a comparison with the basic variables m e g and m v ; s . Both collaborations 
now employ more sophisticated likelihood-based analyses. Unfortunately the full 
details of these have not been made public, so it is difficult for us to make a fair 
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comparison. CMS does say that their likelihood method gives an improvement of 
around 25% in mass resolution compared to using m e ff , and so we expect that further 
gains can still be made by incorporating msv into the current analysis framework. 

We have also not made a full study of the backgrounds, of which many are 
relevant for this search. However, the two backgrounds we did consider are very 
different in their nature (one being a genuine, resonant background and the other 
being a fake, continuum background). We hope therefore, that the other backgrounds 
will be similar to one or other of these in their behaviour. The recent CMS results 
suggest, moreover, that our two backgrounds are the dominant ones in the signal 
region in most of the rr sub-channels (along with pure jets, which we expect to be 
similar to W+jets). 

Finally, we have only considered the most obvious systematic effect, namely 
the uncertainty associated with the normalization of the signal and backgrounds. 
Nevertheless, our qualtitative argument that a better mass reconstruction gives a 
better separation between the signal and the different backgrounds, means that many 
systematic uncertainties are expected to be reduced. 

We hope, at least, that our qualitative arguments and quantitative simulations 
are enough to convince the collaborations to explore the suitability of this method. 
Even if it then turns out that a significant improvement is not obtained using our 
method alone, we remark that an overall improvement can still be expected if it is 
combined with existing approaches. Our method is complementary and, as is clear 
from Fig [7], the observable we extract is not strongly correlated with the existing 
variables m eff and m vis . It thus provides independent information and may be used 
to increase the significance of searches in the rr channel. Our simulations apply to 
the current 8 TeV run and our hope is that application of our method to data being 
produced now will allow us to clear up the mystery of the observed deficit of h —> tt 
decays. Nevertheless, we expect our method to become even more relevant in the 
subsequent stages of the LHC programme. For one thing, the method is limited by 
statistics, but gains in reducing systematic uncertainties. It is the latter that will 
limit our ultimate ability to make precision measurements of the Higgs sector at the 
LHC. What is more, both ATLAS and CMS are planning upgrades or replacements 
of their vertex detectors, with an improvement of a factor of a few expected in the 
vertex resolution. The associated improvement in the rr mass reconstruction using 
our method should reduce the uncertainties even further. 
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A. The treatment of jet masses 

As we discussed in section [2|, in evaluating the likelihood function Q2.5|) , we generate 
a large number of points q in which the jet energy is smeared, as well as the other 
observables, according to the detector resolution. The magnitude of the jet momen- 
tum is then calculated as pj = J E 2 - — m 2 , where nij is the jet mass. We generate 
the jet mass according to the following probabilities: 

Pipximj) = G{rrij] m p , a p ) ■ (1 - R T ^ U ) + 5{rrij - m^) ■ R T ^ U , (A.l) 

for the 1-prong tau-jet and 

A P r(«ij) = G(mj\ m a , a a ), (A. 2) 

for the 3-prong tau-jet, where R t ^ttu is the BR(t — > txv) divided by the branching ra- 
tio of inclusive 1-prong tau decays and G(x; /i, a) is Gaussian probability distribution 
with mean value [i and standard deviation a. We took m p = 775 MeV, a p = 90 MeV, 
m a = 1230 MeV and a a = 160 MeV in our analysis. 

We have checked that the treatment of the jet mass does affect the frequency 
of finding real solutions in the likelihood evaluation, but does not affect much the 
overall shape of the likelihood function. Thus, msv is rather robust against variation 
of the jet mass. To demonstrate this, we show two msv distributions for the h — > tt 
events in the hadronic-hadronic channel in Figure [8], where one uses the jet mass 
described above and the other uses massless jets, with P lpr (mj) = Pz V x{ m j) oc S(m,j). 
As can be seen, the two distributions are very similar. 
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